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SEQUENCE LISTING 



(i) GENERAL INFORMATION: 



(i) APPLICANT: 



Ciossek, Thomas 
Ullrich, Axel 
Millauer, Birgit 



(ii) TITLE OF INVENTION: 



METHODS FOR DIAGNOSIS 
AND TREATMENT OF IVIDKI 
SIGNAL TRANSDUCTION 
DISORDERS 



(iii) NUMBER OF SEQUENCES: 



12 



(iv) CORRESPONDENCE ADDRESS: 



(A) 
(B) 



ADDRESSEE : 
STREET: 



(C) CITY: 

(D) STATE: 
(B) COUNTRY: 
(F) ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
Word Perfect S . 1 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATIONr 



08/368,776 
January 3, 1995 



(vii) PRIOR APPLICATION DATA: 

Prior applications total, 
including application 
described below: 



none 
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(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Warburg 

(B) REGISTRATION NUMBER: 32,327 

(C) REFERENCE/DOCKET NUMBER: 20 8/007 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (213) 41 

(B) TELEFAX: (213) 9' 

(C) TELEX: 67-3510 

{2} INFORMATION FOR SEQUENCE ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS! 



(C) STRANDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: nucleic 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 

CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATJUVCAACAA 150 

TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 3O0 

CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 350 

CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 

GAAATTAGTG GTTTGGATGA. GAACTACACT CCGATAAGAA CATACCAGGT 450 

GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 50 0 

TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 550 

AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 600 
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TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 550 

AAAACCTTTA TGTTAAAATA GACACCATTQ CTGCAGATGA AAGTTTCACA 700 

CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 

GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 

GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 

ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTGGTTCGGA 900 

ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 950 

AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 1000 

CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 1050 

GGACACTTGC GAACCCTGTG GCCGCAGGTT CTACAAATCT TCCTCTCAGG 1100 

ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGA CCGAGAAGGA 1150 

TCATCCAGGT GTGAATGTGA AGATGG6TAC TACAGAGCTC CTTCTGATCC 1200 

ACCATACGTT GCATGCACGA GGCCTCCCTC TGCACCACAG AACCTTATTT 1250 

TCAATATCAA TCAAACGACT GTAAGTTTGG AATGGAGTCC TCCGGCTGAC 13 00 

AACGGGGGAA GAAACGATGT CACCTACAGA ATACTGTGTA AGCGGTGCAG 135 0 

TTGGGAACAG GGAGAATGTG TGCCATGCG6 AAGTAACATT GGATACATGC 1400 

CCCAGCAGAC GGGATTAGAG GATAACTATG TCACTGTCAT GGACCTACTT 1450 

GCCCATGCAA ATTACACTTT CGAAGTTGAA GCTGTAAATG GAGTTTCGGA 1500 

CTTAAGCAGA TCCCAGAGGC TCTTCGCTGC TGTTAGCATC ACCACCGGTC 1550 

AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA TGAAGGAGCG AGTACTGCAG 1600 

CGGAGTGTGC AGCTTTCCTG GCAGGAGCCG GAGCATCCCA ATGGAGTCAT 1650 

CACGGAATAT GAAATCAAGT ATTATGAGAA AGATCAACGG GAAAGGACGT 1700 

ACTCAACACT C/JW^ACCAAG TCCACCTCCG CCTCCATTAA TAATCTGAAA 1750 

CCGGGAACAG TGTACGTCTT TCAGATCCGG GCGGTCACTG CTGCCGGTTA 1800 

TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAG GAAGCTTCAG 1850 

GTAAAATGTT TGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC 1900 

ATAATTGCTG TAGTGGCTGT AGCAGGGACC ATCATCTTGG TGTTCATGGT 1950 

GTTCGGCTTC ATCATTGGAA GAAGGCACTG TGGTTATAGC AAGGCTGACC 2000 
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loe 



AAGAAGGGGA TGAAGAACTC 


TACTTTCATT 


TTAAATTTCC AGGCACCAAA 


2050 


ACCTACATTG ACCCTGAAAC 


CTATGAGGAC 


CCAAATAGAG 


CTGTCCATCA 


2100 


ATTCGCCAAG GAGCTAGATG 


CCTCCTGTAT 


TAAAATTGAG 


CGTGTGATTG 


2150 


GTGCAGGAGA ATTTGGAGAA 


GTTTGCAGTG 


GTCGTTTGAA ACTTCCGGGC 


2200 


CAGAGAGATG TTGCAGTGGC 


CATAAAAACC 


CTGAAAGTTG 


GTTACACAGA 


2250 


AAAGCAAAGG AGGGACTTTT 


TATGCGAAGC 


AAGCATCATG 


GGGCAATTTG 


2300 


ACCACCCAAA TGTCGTCCAT 


TTGGAAGGGG 


TTGTTACAAG 


AGGGAAGCCT 


2350 


GTCATGATTG TGATAGAGTT 


CATGGAGAAT 


GGAGCCCTGG 


ATGCATTTCT 


2400 


CAGGAAACAC GATGGGCAGT 


TTACAGTCAT 


TCAGTTGGTA 


GGAATGTTGA 


2450 


GAGGTATTGC CGCTGGGATG 


CGATACTTGG 


CTGATATGGO 


ATACGTTCAC 


2500 


AGGGACCTTG CAGCGCGCAA 


CATCCTTGTC 


AACAGCAATC 


TTGTTTGTAA 


2550 


AGTGTCAGAT TTTGGCCTTT 


CCCGGGTTAT 


AGAGGATGAT 


CCCGAAGCTG 


2600 


TCTACACCAC GACTGGTGGA 


AAAATTCCAG 


TAAGGTGGAC 


TGCACCGGAA 


2650 


GCCATTCAAT ACCGGAAGTT 


CACCTCAGCC 


AGCGATGTGT 


GGA6CTATGG 


2700 


GATTGTCATG TGGGAAGTGA 


TGTCTTATGG 


AGAAAGACCT 


TACTGGGACA 


2750 


TGTCAAATCA AGATGTCATT 


AAAGCGATAG 


AAGAAGGTTA 


TCGTTTGCCG 


2800 


GCGCCCATGG ATTGCCCAGC 


TGGTCTTCAC 


CAGCTAATGC 


TGGATTGTTG 


2850 


GCAGAAAGAT CGGGCGGAAA 


GGCCAAAGTT 


TGAGCAGATA 


GTCGGAATTC 


2900 


TAGACAAAAT GATTCGAAAC 


CCAAGTAGTC 


TGAAAACACC 


CCTGGGAACT 


2950 


TGTAGTAGAC CCTTAAGCCC 


TCTTCTGGAC 


CAGAGCACTC 


CTGACTTCAC 


3000 


TGCCTTCTGT TCAGTTGGAG AATGGTTGCA AGCTATTAAA ATGGAAAGGT 


3050 


ATAAGGACAA CTTCACAGCA GCGGGTTACA ACTCACTCGA GTCAGTGGCC 


3100 


AGGATGACTA TCGATGATGT 


GATGAGTTTA 


GGGATCACAC 


TGGTTGGCCA 


3150 


TCAAAAGAAG ATCATGAGCA 


GCATCCAGAC 


TATGCGGGCA 


CAAATGTTGC 


3200 


ATTTACACGG AACAGGCATC 


CAAGTGTGAC 


ACATCGGCCT 


CCCTCAGATG 


3250 


AGGCTTAAGA GTGCAGGAGA ACAGTTCTGG 


CCTTCAGTAT 


ACGCATAGAA 


3300 


TGCTGCTAGA AGACAGTTGA 


TATACTGGGT 


CCTTCCTACA 


AGAAAGAGAA 


3350 


GATTTTAGAA GCACCTCCAG 


ACTTGAACTC 


CTAAGTGCCA 


CCAGAATATA 


3400 
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CAAAAAGGGA 


ATTTAGGATC CACCACTGGT GGCCAGGAAC ACAGCAGAGA 


3450 


CAATAAACAA 


AGTACTACCT GAAAAACATC CCAACACCTT GAGCTCTCGA 


3500 


ACCTCCTTTT 


TATCTTATAG ACTTTTTAAA AATGTACATA AAGAATTTAA 


3550 


GAAAGAATAT 


ATTTGTCAAA TAAAAATCAT GATCTTATTG TTAAAATCAA 


3600 


TGAAATATTT 


TCCTTAAAAT ATGTGATTTC AGACTATTCT TTTCCAGAAC 


3650 


CATCTGTGTT 


TATTCTGCTT AAGGACTTTG TTTTAGAAAG TTATTTGTAG 


T 7 nn 

•9 r U U 


CTTTGGACCT 


TTTTAGTGTT AAATTTATGA CACGTTACTA CACTGGGAAC 


3750 


CTTTGAAGAC 


TCTCAAACTT AAAGGAAAGC AAAACTACGC ACATAGTCGa 


J o UU 


GGATGGACTT 


TGTCCTTCAT GGCTTTGGTA TCCTGGCTGT rtTrai-TT'Tr'T 


3850 


TAAACC7V6TG 


ATGTTTTCAT ATTGTTTGCT GATTGGCAGG TAGTTCAAAA 


3900 


TTGCAAGTTG 


CCAAGAGCTC TGATATTTTT TAACAGGATT TTTTTTTCTT 


3950 


TGTAAAAATC 


AGATAACATA CTAACTTTTC AATGAAAAAA AAAAAAAAAG 


4000 


AAGCAATAAT 


GATCCATAAA TACTATAAGG CACTTTTAAC AGATTGTTTA 


4050 


TAGAGTGATT 


TACTAGGCAG AATTTAATAA AAAAAAAAGA GAGATGTCAA 


4100 


ATTTTAGGTT 


TATGTGTATA TGATAAAAGG CTGAGCTTCG TCTGAAGATG 


4150 


CTGGTGAAAG 


CAAGACTGGA AGCGAAGCTC TCCAGCTTTG GCTAACCCAA 


4200 


TCCGAGCACA 


TCAAGAGCTT CAGTCTTGTG ACAGTAAGAA ATTTAGGAAC 


4250 


ATAGTTGACC 


TATATTTTGT ATTCTTTCTT GTTGAATGCA GTCCAAATAC 


4300 


AAAA 




4304 



(2) INFORMATION FOR SEQUENCE ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 amino acids 

(B) TYPE: amino acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp He He Leu Cys Tyr He 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp He 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 , 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
1S5 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 
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Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cya Pro Thr His Ser Phe Ser Asp Aro 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 

310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu lie Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg He Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Glv Ser 
370 375 380 

Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
38S 390 395 4qq 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 
485 490 495 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 
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Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 
565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Glv 
580 585 590 ^ 

Asp Glu Glu Leu Tyr Phe His Phe Lye Phe Pro Gly Thr Lys Thr Tvr 
595 600 605 

He Asp Pro Glu Thr Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe 
610 615 620 

Ala Lys Glu Leu Asp Ala Ser Cys He Lys He Glu Arg Val He Gly 

630 635 640 

Ala Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Glv 
645 650 655 

Gin Arg Asp Val Ala Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr 
660 665 670 

Glu Lys Gin Arg Arg Asp Phe Leu Cys Glu Ala Ser He Met Gly Gin 
675 680 685 

Phe Asp His Pro Asn Val Val His Leu Glu Gly Val Val Thr Arci Glv 
690 695 700 

Lys Pro Val Met He Val He Glu Phe Met Glu Asn Gly Ala Leu Asp 
705 710 715 720 

Ala Phe Leu Arg Lys His Asp Gly Gin Phe Thr Val He Gin Leu Val 
725 730 735 

Gly Met Leu Arg Gly He Ala Ala Gly Met Arg Tyr Leu Ala Asp Met 
740 745 750 

Gly Tyr Val His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser 
755 760 765 

Asn Leu Val Cya Lys Val Ser Asp Phe Gly Leu Ser Arg Val He Glu 
770 775 780 

Asp Asp Pro Glu Ala Val Tyr Thr Thr Thr Gly Gly Lys He Pro Val 
785 790 795 800 

Arg Trp Thr Ala Pro Glu Ala He Gin Tyr Arg Lys Phe Thr Ser Ala 
805 810 815 

Ser Aep Val Trp Ser Tyr Gly He Val Met Trp Glu Val Met Ser Tyr 
820 825 830 

Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val He Lys Ala 
835 840 845 
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He Glu Glu Gly Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Gly 
850 855 360 

Leu His Gin Leu Met Leu Asp Cys Trp Gin Lys Asp Arg Ala Glu Arg 
865 870 875 860 

Pro Lys Phe Glu Gin He Val Gly He Leu Asp Lys Met He Arg Asn 
885 890 895 

Pro Ser Ser Leu Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser 
900 905 910 

Pro Leu Leu Asp Gin Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val 
915 920 925 

Gly Glu Trp Leu Gin Ala He Lys Met Glu Arg Tyr Lys Asp Asn Phe 
530 935 940 

Thr Ala Ala Gly Tyr Asn Ser Leu Glu Ser Val Ala Arg Met Thr He 
945 950 955 960 

Asp Asp Val Met Ser Leu Gly He Thr Leu Val Gly His Gin Lys Lys 
965 970 975 

He Met Ser Ser He Gin Thr Met Arg Ala Gin Met Leu His Leu His 
980 985 990 

Gly Thr Gly He Gin Val 
995 



(2) INFORMATION FOR SEQUENCE ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ' peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp He He Leu Cys Tyr He 
1 5 10 15 . 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp He 
35 40 45 
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Ser Ser Pro Pro Ser Gly Trp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 

70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lye Gly Asn Ala Gin 
85 90 35 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 

105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tvr Tvr 

120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tvr Val 
130 135 -^^0 ^ 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
^'^^ 150 155 ^ ISO 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cvs 
180 185 xso ^ 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lye Lys Cys Trp Thr He 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val. Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 
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Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Si 



Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 
485 490 495 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 
565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Ser Leu Val Thr Asn Glu His Leu Ser 
595 600 605 

Val Leu 
610 
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(2) INFORMATION FOR SEQUENCE ID NO: 4: 



(i) 



SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDNESS 

(D) TOPOLOGY: 



2901 base pairs 
nucleic acid 
single 
linear 



(ii) 



MOLECULE TYPE: 



nucleic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 
CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 
GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 
TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 
CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 
TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 
CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 
CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 
GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 
GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 
TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 
AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 
TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 
AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 
CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 
GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 
GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 
ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTOGTTCGGA 
ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 
AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 
CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 
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50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
90O 
950 
1000 
1050 
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GGACACTTGC 


GAACCCTGTG 


GCCGCAGGTT 


CTACAAATCT 


TCCTCTCAGG 


1100 


ATCTCCAGTG 


TTCTCGTTGT 


CCAACCCACA 


GCTTCTCTGA 


CCGAGAAGGA 


1150 


TCATCCAGGT 


GTGAATGTGA 


AGATGGGTAC 


TACAGAGCTC 


CTTCTGATCC 


1200 


ACCATACGTT 


GCATGCACGA 


GGCCTCCCTC 


TGCACCACAG 


AACCTTATTT 


1250 


TCAATATCAA 


TCAAACXyiCT 


GTAAGTTTGG 


AATGGAGTCC 


TCCGGCTGAC 


13 00 


AACGGGGGAA 


GAAACGATGT 


CACCTACAGA 


ATACTGTGTA 


AGCGGTGCAG 


-L J 3 U 


TTGGGAACAG 


GGAGAATGTG 


TGCCATGCGG 


AAGTAACATT 


GGATACATGC 


1 4.nrt 

Xt\JU 


CCCAGCAGAC 


GGGATTAGAG 


GATAACTATG 


TCACTGTCAT 


GGACCTACTT 




GCCCATGCAA 


ATTACACTTT 


CGAAGTTGAA 


GCTGTAAATG 


GAGTTTCGGA 


Ji. ^ u u 


CTTAAGCAGA 


TCCCAGAGGC 


TCTTCGCTGC 


TGTTAGCATC 


ACCACCGGTC 


1550 


AAGCAGCTCC 


CTCGCAAGTG 


AGTGGAGTCA 


TGAAGGAGCG 


AGTACTGCAG 


1600 


CGGAGTGTGC 


AGCTTTCCTG 


GCAGGAGCCG 


GAGCATCCCA 


ATGGAGTCAT 


1650 


CACGGAATAT 


GAAATCAAGT 


ATTATGAGT^ 


AGATCAACGG 


GAAAGGACGT 


1700 


ACTCAACACT 


CAAAACCAAG 


TCCACCTCCG 


CCTCCATTAA TAATCTGAAA 


1750 


CCGGGAACAG 


TGTACGTCTT 


TCAGATCCGG 


GCGGTCACTG 


CTGCCGGTTA 


1800 


TGGAAACTAC 


AQCCCTAGGC 


TTGATGTTGC 


CACACTTGAG 


GAAGCTTCAG 


1850 








GTGAACAGAA 


TCCTGTCATC 


1900 


ATAATTGCTG 


TAGTGGCTGT 


AGCAGGGACC 


ATCATCTTGG 


TGTTCATGGT 


1950 


GTTCGGCTTC 


ATCATTGGAA 


GAAGGCACTG 


TGGTTATAGC 


AAGGCTGACC 


2000 


AAGAAGGGGA 


TGAAGAACTC 


TACTTTCATT 


CTTTAGTAAC 


AAATGAGCAC 


2050 


CTGTCAGTTT 


TATAAACCGC 


AACAATAACT 


GTTTAAGACA 


ATCAATTTTG 


2100 


GATAAACAAT 


CAACTACAGC 


AGAATAAATC 


AAGATTTTTA 


AGTCCCATTT 


2150 


TCCTTTATAC 


ATTCTGCTTA 


TTTTGTTGTT 


ATATGTTTAT 


TTTTTAAACT 


2200 


CTGATCTTGA 


TTGT^TGTGA 


TACCATAAGC 


ACAGTTAGGC 


TGCAGTGTAA 


2250 


ATATATAAAG 


ACATTGTTCT 


GAGAGCAGTA 


CGATTTCATG 


GAAAGATTGT 


2300 


TTGGTGGCTT 


TGTTAAAATT 


AATAAAGAAT 


TTTTAAGGAT 


ATAGTGTAAT 


2350 


TTTCTTCATT 


GCATTAATAT 


AACCAAATAT 


GCCTACCTAT 


CTTTGTCTTG 


2400 


AACCAAATGA 


ATAGATTTGG 


AATACTTTAT 


TGTAATTGAA 


TTTGATATAA 


2450 
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AGTTGACTGA GCATTTATGT GTTACCTGCA TGCTTCTGGG TGCATTGAAA 2500 

TATTTTAACT TTTAAAATGA TACTATGTTG TTTCAATTTT GACTACCTTT 2550 

TGTGAGGCAT ACTGGCTACC TCCTCCTATT AGCTAAGATC TTCCAAAGCC 2600 

TTATAATGAA AAGTTTATAT AAACCATTTC TCTTTCAAAT CACTGTCATA 2650 

CTTGGTCACG GATCCCAGGA ATATTGTAAA TTTTCTAATT TACTCTGCAC 2700 

TTTGTATATC CAGCCTCTAT TACCCTCAAG GTGAATATAA AACTATGTCT 2 7S0 

TTTGAATATT TCTCTTTGAT TTTGTGATAG CAGTCCCTCA TATCTTGTAC 2800 

TAATTTTATG TATATGTCAA CAGTGGTTGG TCTTTAAAAA TAAATCAAAG 2850 

AATAAGTAAA AAAAAAAAAA AAAAAAAAAA AAAAATAAAA AAAAAAAAAA 2900 

^ 2901 



(2) INFORMATION FOR SEQUENCE ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 amino acids 

(B) TYPE: amino acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 
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Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tvr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 

150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
165 170 175 

Leu Ser Lys Lya Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cvs 
180 185 190 ^ 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 

230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lvs 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Ara 
290 295 300 ^ 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 

310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu Tro Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg He Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Glv Ser 
370 375 380 

Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tvr Val 

390 395 400 
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Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 

Ala val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
^'^^ 425 

Ala val ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 

440 4^5 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 

455 4gQ 

Glu Pro Glu Hia Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 

^"^^ 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lye Thr Lys 



495 



Ser Thr Ser Ala Ser He Aan Asn Leu Lys Pro Gly Thr Val Tyr Val 

505 520 

Phe Gin lie Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 

520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 

535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pre Val He He He Ala Val 

555 560 
Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 



575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 



590 



Asp Glu Glu Leu Tyr Phe His Ser Leu Tyr Arg Glu Arg Gly Aap Gly 
Met Glu Lys Thr Gin Hie Asn Lys Lys Trp Met He Ala Ser Cys Ser 

Arg Leu 
625 



(2) INFORIVIATION- FOR SEQUENCE ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

{oi 2323 base pairs 

B TYPE: nucleic acid 

( C ) STRANDNESS : s ingl e 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: nucleic 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 

CGAACCQGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 

TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 300 

CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 3 50 

CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 

GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 450 

GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 500 

TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 550 

AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 600 

TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650 

AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 700 

CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 

GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 

GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 

ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTG6TTCGGA 900 

ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 950 

AAGAGGCAGA AAATTCCCCC AGAATGCATT 6CAGTGCAGA AGGAGAGTGG 1000 

CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 1050 

GGACACTTGC GAACCCTGTG GCCGCAGGTT CTACAAATCT TCCTCTCAGG 1100 

ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGA CCGAGAAGGA 1150 

TCATCCAGGT GTGAATGTGA AGATGGGTAC TACAGAGCTC CTTCTGATCC 1200 

ACCATACGTT GCATGCACGA GGCCTCCCTC TGCACCACAG AACCTTATTT 1250 

TCAATATCAA TCAAACGACT GTAAGTTTGG AATGGAGTCC TCCGGCTGAC 13 00 
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AACGGGGGAA GAAACGATGT CACCTACAGA ATACTGTGTA AGCGGTGCAG 
TTGGGAACAG GGAGAATGTG TGCCATGCGG AAGTAACATT GGATACATGC 
CCCAGCAGAC GGGATTAGAG GATAACTATG TCACTGTCAT GGACCTACTT 
GCCCATGCAA ATTACACTTT CGAAGTTGAA GCTGTAAATG GAGTTTCGGA 
CTTAAGCAGA TCCCAGAGGC TCTTCGCTGC TGTTAGCATC ACCACCGGTC 
AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA TGAAGGAGCG AGTACTGCAG 
CGGAGTGTGC AGCTTTCCTG GCAGGAGCCG GAGCATCCCA ATGGAGTCAT 
CACGGAATAT GAAATCAAGT ATTATGAGAA AGATCAACGG GAAAGGACGT 
ACTCAACACT CAAAACCAAG TCCACCTCCG CCTCCATTAA TAATCTGAAA 
CCGGGAACAG TGTACGTCTT TCAGATCCGG GCGGTCACTG CTGCCGGTTA 
TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAG GAAGCTTCAG 
GTAAAATGTT TGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC 
ATAATTGCTG TAGTGGCTGT AGCAGGGACC ATCATCTTGG TGTTCATGGT 
GTTCGGCTTC ATCATTGGAA GAAGGCACTG TGGTTATAGC AAGGCTGACC 
AAGAAGGGGA TGAAGAACTC TACTTTCATT CTCTTTACAG GGAAAGGGGA 
GACGGGATGG AAAAGACACA GCACAATAAG AAGTGGATGA TTGCATCGTG 
CTCTCGTTTG TAGGTCTCTT TTCCTAATCA ACACTATGAT TTTGAAGTAC 
GCGTACACGA AGCAAACGGG AAGAGATAAG GAATTAGCAT TGTGAACCTG 
ACTGTAATCC TCTCTTCCGG AAAGAGATGA GATGCTATTG CGATGAGAAT 
GTACAACTTG CACCTTGAAA TCTTTTTTGA TAATTAGTGC TCAGGGGAGG 
GGGGGGGAAG TAGAGAAAGC AAA 

(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 



1350 

1400 

1450 

1500 

1550 

1600 

1650 

1700 

1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2323 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS 

(D) TOPOLOGY: 



6 amino acids 
atnino acid 
single 
linear 



(ii) MOLECULE TYPE: 



peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 7; 

Ala Ala Thr Ala Ala Ala 
5 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH : 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



6 amino acids 
amino acid 
single 
linear 



(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Ala Thr Ala Ala Ala 

5 



<2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



6 amino acids 
amino acid 
single 
linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

His Arg Agp Leu Ala Ala 

5 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 



6 amino acids 
amino acid 
single 
linear 

peptide 
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(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 2 is valine 

or methionine; Xaa in 
position 5 is phenylalanine 
or tyrosine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Xaa Trp Ser Xaa Gly 
5 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp He He Leu Cys Tyr He 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp He 
3S 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 

70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tvr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 
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Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 

Gly Glu Arg Lys Met Lye Leu Asn Thr Qlu Val Arg Glu He Gly Pro 
165 170 

Leu ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 



205 



Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 

, 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 

"° 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 ' 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 

280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 

295 300 ^ 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 

330 335 

^Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 

345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg He Leu 

360 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 

375 390 

Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 

395 

Thr val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 
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Ala Val Ser lie Thr Thr Gly Gin Ala Ala Pro Se. Gin Val Ser Gly 
Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 

460 

Glu Pro Glu His Pro Asn Gly Val lie Thr Glu Tyr Glu He i,ye Tyr 

^"^^ 480 
Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lye 

490 

ser T^r Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 

Phe Gin lie Arg Ala Val Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 

525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Ala Thr Ala Val Ser 

535 

ser Glu Gin Asn Pro Val He lie He Ala Val Val Ala Val Ala Gly 

560 

Thr He He Leu Val Phe Met Val Phe Gly Phe He He Gly Arg Arg 

Hia Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly Asp Glu Glu Leu Tyr 

585 

Phe His Phe Lys Phe Pro Gly Thr Lys Thr Tyr He Asp Pro Glu Thr 

600 

Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe Ala Lys Glu Leu Asp 



S90 

Lys Thr Tyr He Asp 

600 

Val His Gin Phe Ala 
615 g20 

Ala ser Cys He Lys He Glu Arg Val He Gly Ala Gly Glu Phe Gly 

640 

Glu val cys ser Gly Arg Leu Lys Leu Pro Gly Gin Arg Asp Val Ala 

655 

val Ala He Lys Thr Leu Lys Val Gly Tyr Thr Glu Lys Gin Arg Arg 

670 

ASP Phe Leu Cys Glu Ala Ser He Met Gly Gin Phe Asp His Pro Asn 

685 

Val val His Leu Glu Gly Val Val Thr Arg Gly Lys Pro Val Met He 

700 

val He Glu Phe Met Glu Asn Gly Ala Leu Asp Ala Phe Leu Arg Lys 



715 720 
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Hi. ».P 01, =X„ 

Ala Oly ^ ^„ ^ 2= ^ 

^ Ala Ala ^ ^„ v.l ^„ ,„ ,„ 

V.1 3.J .»p pHa =1, ^ , ^ ^ 

780 

v.1 ^ TH, ™. Th. 01, .y. XI. v.1 A« ^ ^„ 

800 

Glu Ala He Gin Tyr Arg Lys Phe Thr Ser Ala Ser v n . 

805 „ ^-^^ Asp Val Trp Ser 

^ BIS 

'V. oly 11. v.1 v.1 ^ ^ ^ 

830 

Trp ASP Met Ser Asn Cln Asp Val He .ys Ala He Slu oXu Cly 

845 

Arg Leu Pro Ala Pro Met Asp Cys Pro Ala aly Leu His Cln Leu Met 
Leu ASP cys Trp Oln Lys Asp Ala .1. Ar. IZ p,, 

880 

v.1 01, II. J.„ ^ ^^^^ 

xhr P.O ^ ^. ^ 21 

910 

ser Thr Pro Asp PHe ^r Ala Phe Cys Ser Val Cly clu Trp Leu Cln 

925 

Ala lie Lys Met Glu Arg gr Lys Asp Asn Phe Thr Ala Ala Gly Tyr 

Asn ser Leu Glu Ser Val Ala Arg Met Thr lie Asp Asp Val Met Ser 

960 

01, 11. ^ v.1 01, „la .!„ ^.^^ 
01„ Th, „.t Aj, M. 

990 

Val 
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(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp He He Leu Cys Tyr He 
^ S 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lya Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp He 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tvr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
195 200 205 
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Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 

230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Glv Glu Tm 
245 250 255 

Leu Val Pro lie Gly Lys Cys lie Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Ara 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser lie Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val lie Thr Glu Tyr Glu lie Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 

485 490 495 
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Ser Thr Ser Ala Ser He Asn Aen Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 
565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Cys Thr Lys Thr Tyr He Asp Pro Glu 
595 600 605 

Thr Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe Ala Lys Glu Leu 
610 615 620 

Asp Ala Ser Cys He Lys He Glu Arg Val He Gly Ala Gly Glu Phe 
625 630 635 640 

Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Gin Arg Asp Val 
645 650 655 

Ala Val Ala He Lye Thr Leu Lys Val Gly Tyr Thr Glu Lys Gin Ara 
660 665 670 

Arg Asp Phe Leu Cys Glu Ala Ser He Met Gly Gin Phe Asp His Pro 
675 680 

Asn Val Val His Leu Glu Gly Val Val Thr Arg Gly Lys Pro Val Met 
690 695 700 

He Val He Glu Phe Met Glu Asn Gly Ala Leu Asp Ala Phe Leu Arq 
705 710 715 720 

Lys His Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met Leu Aro 
725 730 735 

Gly He Ala Ala Gly Met Arg Tyr Leu Ala Asp Met Gly Tyr Val His 
740 745 

Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser Asn Leu Val Cvs 
755 760 765 

Lys Val Ser Asp Phe Gly Leu Ser Arg Val He Glu Asp Asp Pro Glu 
770 775 780 
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Ala Val Tyr Thr Thr Thr Gly Gly Lys He Pro Val Arg Trp Thr Ala 
"785 790 795 800 

Pro Glu Ala He Gin Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val Trp 
805 810 815 

Ser Tyr Gly He Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg Pro 
820 825 830 

Tyr Trp Asp Met Ser Asn Gin Asp Val He Lys Ala He Glu Glu Gly 
835 840 845 

Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Gly Leu His Gin Leu 
850 855 860 

Met Leu Asp Cys Trp Gin Lys Asp Arg Ala Glu Arg Pro Lys Phe Glu 
865 870 875 880 

Gin He Val Gly He Leu Asp Lys Met He Arg Asn Pro Ser Ser Leu 
885 890 895 

Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser Pro Leu Leu Asp 
900 905 910 

Gin Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val Gly Glu Trp Leu 
915 920 925 

Gin Ala He Lys Met Glu Arg Tyr Lys Asp Asn Phe Thr Ala Ala Gly 
930 935 940 

Tyr Asn Ser Leu Glu Ser Val Ala Arg Met Thr He Asp Asp Val Met 
545 950 955 960 

Ser Leu Gly He Thr Leu Val Gly His Gin Lys Lys He Met Ser Ser 
965 970 975 

He Gin Thr Met Arg Ala Gin Met Leu His Leu Hie Gly Thr Gly He 
980 985 990 

Gin Val 
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